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The system-wide study of proteins presents an exciting challenge in this information-rich age of whole-genome 
biology. Although traditional investigations have yielded abundant information about individual proteins, they 
have been less successful at providing us with an integrated understanding of biological systems. The promise of 
proteomics is that, by studying many components simultaneously, we will learn how proteins interact with each 
other, as well as with non-proteinaceous molecules, to control complex processes in cells, tissues and even whole 
organisms. Here, I discuss the role of microarray technology in this burgeoning area. 



As the scope of proteomics is extrcinch- broad, it is helpful to dis- 
tinguish among its various aspects before tliscussiiig how 
microarray technology is lilcely to feature. Over the past decade, 
the marriage of two technologies — two-dimensional gel elec- 
trophoresis and mass spectrometry — has spawned what I refer to 
as 'unbiased' or 'discovery-oriented proteomics' (Fig. 1). In a 
typical discovery- oriented experiment, a biological sample is 
analyzed by separating, tjuantifying and identiiying as many pro- 
teins as possible, often with an emphasis on those proteins with 
altered abundance relative to a reference sample. The approach is 
unbiased because the investigator does not impose his or her 
knowledge of biology on the experimental design; it is discovery- 
oriented because unknown proteins, as well as known proteins, 
may be identified in the course of the experiment. 

Between this and traditional biochemistry lies 'focused' or 
'systems- oriented proteomics' (Fig. 1). Here, the investigator 
directs the study by defining a subset of proteins to be analyzed, 
such as a family of proteins related by sequence or a collection of 
proteins related by function. Although microarray technology 
will undoubtedly have an important role in discovery-oriented 
proteomics, it is particularly well suited to systems- oriented 
investigations. 

Another important distinction in proteomics concerns the 

objective of the study. In one type of experiment, the investiga- 
tor aims to analyze a particular biological sample such as a 
tumor biopsy from an individual with cancer. For such experi- 
ments, a complete 'proteomic analysis' would involve measuring 
the abundance, modification, activity, localization and interac- 
tion of all the proteins in that sample. In practice, current tech- 
nologies limit our analysis to only one or two of these 
parameters and to only a fraction of the proteins. This type of 
experiment is the protein equivalent of expression profiling and 
is broadly known as 'protein profiling'. I refer to this aspect of 
proteomics as 'quantitative proteomics' to distinguish it from 
the second type of investigation, 'functional proteomics', in 
which one seeks instead to define the function of every protein 
in a given organism. This may involve, for example, identifying 
putative substrates for enzymes or putative interactions between 
proteins. It is already clear that microarray technology will have 
a significant impact on both quantitative proteomics and func- 
tional proteomics, and in this review I address each of these 
areas in turn. 



Quantitative proteomics 

Microarray technology is linding its way into quantitative pro- 
teomics through the construction of what are most accurately 
called 'protein-detecting microarrays'^ A protein-detecting 
microarray comprises many different affinity reagents (fre- 
quently antibodies) arrayed at high spatial density on a solid sup- 
port. Each agent captures its target protein from a complex 
mixture (such as serum or cell lysate), and the captured proteins 
are subsequently detected and quantified. 

Historically, the roots of protein-detecting microarrays lie in 
the development of inniiunoassays. As early as 1929, antibodies 
were used in serology to precipitate antigens for subsequent 
quantification^. Analytical immunoassay technology was greatly 
advanced with the introduction of the radioimmunoassay (RIA) 
by Yarlow and Berson^ in 1959 and the enzyme-linked 
immtinosorbant assay (ELIS A) by Engvall and Perlman* in 1 97 1 . 
In the late 1980s, Roger Ekins argued that extremely sensitive, 

'niicrospots' of antibodies on soUd supports and thai niultiana- 
lyte assays could be carried out using spatially separated arrays of 
such spots^. But it was not until the late 1990s — when the bene- 
fits of DNA microarray technology were well recognized and the 
equipment to manufacture microarrays became more 
accessible — that attention shifted to the development of anti- 
body microarrays for the purpose of quantitative proteomics. 
Sandwich immunoassays. One of the earliest reports demon- 
strating the feasibility of multianalyte, microarray-based 
immunoassays was published by Silzel et al.^ in 1998. Using a 
standard Inkjet printer, they spotted monoclonal antibodies 
directed against the four human immunoglobulin-y (IgG) 
subclasses onto a thin sheet of polystyrene film to form spots 
with a diameter of 200 |am. Using a sandwich-based assay (Fig. 
2a), they were able to show subclass-specific recognition of 
human myeloma proteins with minimal crossreactivity and to 
observe dose-dependent signals for each subclass. Although 
the spotting methods were relatively crude and difficult to 
scale up, this early study showed that several proteins could be 
detected and quantified in a parallel fashion using antibody 
microarrays. 

In one of the first studies to show that multiplexed sand- 
wich immunoassays could be used to analyze biological sam- 
ples. Moody et al7 fabricated arrays of seven different antibodies 
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against cytokines in a 3 X 3 pattern in 96-well polystyrene 
microtiter plates. They then used these arrays to monitor the 
abundance of cytokine production by THP- 1 cells on stimulation 
with Iipopolysaccharide, either alone or in combinalioii wiili ilic 
anti-inflammatory drug dexamethasone. Captured cMokiiies 
were probed with a cocktail of detection antibodies, and tlie sig- 
nal was amplified by enzyme- catalyzed chemikiinmescence. Ihis 
provided the sensitivity needed to detect cytokine concentrations 
of less than a picogram per milliliter. Shortly thereafter, Huang et 
al.^ reported fabricating hand-spotted arrays of antibodies on 
nitrocellulose membranes and showed that they could detect up 
to 24 different c}'tokines in both conditioned media and human 
serum in a multiplexed sandwich assay, also using a chemilumi- 
nesccnl detection system. Several other groups have suhse- 
quenlly reported antibody arrays that measure human cytokines 
(see, lbrexanipfe,rels 9-11). 

In the most extensive study carried out so far, Schweitzer 
et al}^ fabricated microarrays of 75 antibodies against cytokines 
on chemically derivatized glass slides and used these arrays to 
study cytokine secretion from human dendritic cells induced 
with Iipopolysaccharide or tumor- necrosis factor-a. To increase 
Ihe sensilivily of liicir assay and retain high spatial resolution 
(not afforded by enzyme- catalyzed chemiluminescciicc), they 
used isothermal rolling- circle amplification'^ to develop their 
sandv«ch assays. Particularly notable is the extent to which they 
were able to multiplex this technique. Some of the detection anti- 
bodies were found to crossreact vnth other cytokines on the 
array. To avoid this problem, Schweitzer etal}'^ divided their col- 
lection of capture antibodies into two roughly equal-sized 
groups. Each group of antibodies was spotted on a different por- 
tion of the same glass slide separated by a Teflon barrier. In this 
way, spurious signals generated by antibody crossreactivity could 
be avoided. This strategy of subdivision represents a potentially 
general solution to some of the problems inherent in multiplexed 
sandwich assays (see below). 

It is no coincidence that almost every sandwich microarray- 
based assay that has been reported is directed at the analysis of 
human cytokines. There are two reasons for this. First, matched 
pairs of high- affinity, high-speciticit}- anlihixlics lia\c Ixcn 
developed and commercialized over the past sc\ci al decades lor 
use in diagnostic kits based on sandwich ELISAs. The availability 
of high-performance, validated reagents circumvents the largest 
hurdle in developing microarray-based assays. Second, cytokines 
are secreted fi-om cells; thus, analyses are done on relatively sim- 
ple mixtures in comparison to cellular lysates. 

In experiments carried out in collaboration with Peter Sorger, 
my research group has found that it is much more difiicult to 
find antibodies to study intracellular proteins. In fact, only about 
5% of over 100 commercial antibodies that we have tested are 
suitable for microarray-based analyses of cellular lysates (U.B. 
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Fig. 1 The spectrum of protein analysis, ranging from traditional biochemistry 
to unbiased proteomics. Focused proteomics occupies a middle ground, where 
one seeks to maximize both the depth and the breadth of biochemical investi- 



Nielsen, M.H. Cardone, A.J. Sinskey, G.M. and RK. Sorger, 
unpublished data). Nevertheless, we have been able to develop 
arrays lhal report on both the abundance and the modification 
state ol intracellular signaling proteins, including integral mem- 
brane receptors. We have also developed a ratiometric quantifi- 
cation method in which one detection antibody, labeled with a 
lluorophore, reports on the 'abundance' of the captured protein, 
while a second detection antibody, labeled with a different col- 
ored fluorophore, reports on a specific post-translational 'modi- 
fication' state of that protein. The resulting ratio of fluorescence 
provides a measure of the fraction of protein in the lysate that has 
been modified (for example, phosphorylated; U.B. Nielsen, M.H. 
Cardone, A.J. Sinskey, G.M. and P.K. Sorger, unpublished data). 
We anticipate that this and similar strategies will be particularly 

Antigen capture immunoassays. In an alternative approach to 
the sandwich assay, it is possible to detect captured antigens 
without using a second antibody (Fig. Ih). This requires either a 
label-free detection method (such as mass spectrometry or sur- 
face plasmon resonance) or the chemical labeling of all proteins 
in a sample. In one of the earliest and most extensive attempts to 
examine the feasibility of a two-color labeling approach, Haab 
el al}^ assembled a collection of 115 antibodies and their protein 
ligands. They prepared defined mixtures of the antigens, labeled 
the niLXtures with activated Cy5 dye, and combined each mixture 
with a Cy3-labeled reference mixture. The proteins were then 
applied to antibody microarrays comprising the 115 antibodies 
spotted at high density onto slides coated with poly-L-lysine. 
Haab et al}^ found that only 20% of the arrayed antibodies pro- 
vided specific and accurate measurements of their target antigens 
at a eoneentratioii of 1 .6 pg/iiii or less, but that some antibodies 
could detect ligands at concentrations of less than 1 ng/ml and at 
partial concentrations of 1 part in 1,000,000. 

This ratiometric, two color approach was recentiy applied to 
the proteomic analysis of LoVo colon carcinoma cells in response 
to treatment with ionizing radiation'^. Proteins in lysates from 
untreated cells and cells collected 4 h after irradiation were 
labeled with Cy3 and Cy5, respectively. The samples were mixed 
and applied to a microarray composed of 146 distinct antibodies 
directed against proteins involved in stress response, cell- cycle 
progression and apoptosis. In addition to observing the upregu- 
lation of five apoptotic proteins already known to be induced by 
radiation, Sreekumar et al}^ discovered six other proteins that 
were upregulated and one that was downregulated; most of these 
results were confirmed subsequently by immunoblotting. 

This study is significant because it shows that protein- 
detecting microarrays can provide new information about pre- 
\ iously characterized proteins and thus can be used to generate 
Iresh hypotheses. But it is important to note that, although 
many different antibodies were arrayed, the researchers did little 
to validate their performance in a microarray format. It is well 
known that most antibodies crossreact significantiy with other 
cellular proteins, but none of the antibodies were tested for 
crossreactivity using cellular lysates. These arrays must therefore 
be seen as tools for discovery rather than as tools for quantitative 
analysis. One must also view the number of elements on their 
arrays with some skepticism: although 146 antibodies were 
immobilized, it is not clear what percentage of these antibodies 
generated meaningful data. 
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Fig. 2 Current immunoassay strategies used in protein-detecting microarrays. a. In a sandwich immunoassay, 

antibodies are immobilized on the solid support, and bound proteins are detected using a second, labeled dc 

ntibody. b. In an antigen capture assay, proteins are similarly captured by immobilized antibodies, but the captured gression, whereas the abun- 



matched 
epithelium, pr 

ithelial neoplasia and invasive 
prostate cancer. All three types 
of cell were isolated from 10 
individuals by laser capture 
microdissection. The cells were 
lysed, arrayed on nitrocellulose- 
coated slides and probed with 
different antibodies. Paweletz 
et alP found that the abun- 
dance of phosphorylated Erk 
decreased during disease pro- 



d directly. This is usually accomplished by chemically labeling the complex mixture of protein 
before applying them to the array In the two-color version of this assay, two samples are labeled independently with 
distinguishable f luorophores, and the samples are mixed before applying them to the array, c. In a direct assay, the com- 
plex mixture of proteins is itself immobilized on the solid support, and specific proteins in that mixture are visualized 
using labeled detection antibodies. 



dance of phosphorylated Akt 

tended to increase. Control 

experiments suggested that the 



In principle, the two-color approach permits 
quantification of the relative abundance of proteins, but it is 
hampered by an inherent lack of sensitivity. Because the 

approach does not offer signal amplification beyond that pro- 
vided by the use of tluorophores, the limit of detection is typi- 
cally no better than 1 ng/ml of analyte. Enz}'me-catalyzed signal 
amplification improves the limit of detection by at least three dec 
orders of magnitude. Taking advantage of this increased sensitiv- 1 
ity at the cost of adopting a one-color approach, Knezevic etal}'^ 
have used microarrays of 368 distinct antibodies, coupled with compk 



assay was around 1 ng/ml; how- 
ever, because these experiments 
were done with purified antigen rather than with complex lysates, 
the actual limit of detection is likely to be considerably higher. 
Using a similar approach, Madoz-Gurpide et al?'^ have described 
the benefits of simplifying cell lysates chromatographically before 
arraying them on a solid support. This strategy increases the 
:ity of direct immunoassays, but at the cost of substantially 
sing throughput. 
In a different vein, microarrays of defined antigens represent 
special type of direct immunoassay that can be used to profile 



containing immunoglobulins. For these 



laser capture microdissection, to profile proteins in both normal experiments, the immobilized antigens direct the 'capture' of 
and cancerous epithelium in the oral cavity, as well as in stromal solution-phase immunoglobulins, although in reality it is the 



cells adjacent to and surrounding these tissues. Proteins in cellu- 
lar lysates were biotinylated and applied to the arrays, and the experiments by Mendi 
captured proteins were detected using an enzyme-linked colori- 
metric assay. As with the study of Sreekumar etalP, these arrays 
must be seen as tools for discovery rather than quantitative 
analysis because little attempt was made to evaluate the perfor- 
;e of each antibody. Although only 14% of the antibodies 



immunoglobuUns that are recognizing the antigens. Early 
showed that protein antigens 



dies. More recently, 
antigen-containing 



TO arrays of either purified antigens 
have been used to profile aller- 
specific IgEs from human serum^^"^*, to characterize and 



the arrays gave consistently positive signals with any of the oral quantify autoantibody responses in individuals with autoim- 



sdiagnose people with parasitic or 



i with respect to Advantages and limitations of protein-detecting microarrays. In 



cavity tissues, Krizman and co-workers were nevertheless able to mune diseases^'''^^, and to sc 
identify 1 1 proteins that reproducibly changed in either their rel- viral infections^ 
ative abundance or their state of phosphorylatio 
disease progression. The discovery of multiple markers of disease 
progression and their facile analysis using protein- detecting electroph( 
microarrays holds great promise for diagnosis, prognosis and b 
lored medicine^''. 

In an interesting twist on the issue of direct detection, Bel( 
et al}^ have used a microarray of 60 antibodies directed against lar biolog 
cluster of differentiation (CD) antigens to immunophenotype process ne 
leukocytes obtained from both normal individuals and individu- 
als with various types of leukemia. Because these antigens are 
expressed on the surface of cells, the researchers could apph* 
whole cells to their arrays and detect captured cells using dark- 
field microscopy. From the expression pattern of the CD anti- 



relatively unbiased separation methods such a: 
esis and hquid chromatography, affinity-based 
approaches enable the investigator to direct the experiment. This 
why microarray technology is best suited to systems- oriented 
ligations. If the goal of the experiment is to study a particu- 
rocess, only those proteins involved in that 
process need to be examined. The downside, of course, is that 
these investigations require both a prior knowledge of the pro- 
teins to be studied and appropriate affinity reagents. This high- 
lights the importance of efforts directed at the rapid selection of 
recombinant antibodies, antibody substitutes and small- 
molecule ligands, and it will be interesting to see how these tech- 
gens, they defined a fingerprint that is diagnostic of chronic nologies develop over the next few years.What, then, are the rela- 



lymphocytic leukemia and are currentiy using their array 
establish fingerprints for other types of leukemia. 
Direct immunoassays. In a third approach to protein profiling, it 
) immobilize the samples themselves, rather tlian the 



merits of each type of protein- detecting microarray, and 
which approach is best? In my (current) estimation, direct 
says are not likely to see widespread use. Although this 
idily permits signal amplification, pre 



affinity reagents, on the sohd support Specific proteins in these abundance are difficult to observe because they must compete 
samples can then be detected and quantified by probing with with more abundant proteins for immobilization on the solid 
labeled antibodies. Paweletz et alP have used this approach to support. The technique, like the dot-blot that it mimics, is also 
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Fig. 3 Complications arising from regulated protein-protein interactions, a. In 
an antigen capture assay, a more intense signal is observed when the target 
protein is involved in a multiprotein complex (condition 2) than when it is free 
in solution (condition 1). b, In a sandwich assay, a similar problem occurs if 
detection antibodies are present that recognize other components of the mul- 
tiprotein complex. 
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limited by the speciticity ot the affinity reagents (see below), and 
the accuracy ol ciuaiinlicanoii is limited by the reproducibility of 
sample preparation and spotting. 

Two-color antigen capture methods are less demandins; in 
terms ot sample preparation and spotting because quantihcalion 
involves a ratiometnc readout. In its current form, tlie two-color 
method does not involve signal amplification (beyond that 
afforded by the use of fluorescent dyes) and thus has limited sen- 
sitivity (~1 ng/ml). This may be circumvented, however, by the 
use of two orthogonal enzyme- catalyzed amplification methods 
or by two-color rolling-circle amplification. Like the direct assay, 
this approach is limited by the specificity of the affinity reagents. 
In addition, the need to label proteins presents a confounding 
problem: some proteins may be labeled preferentially on their 
antigenic epitopes and so lose their ability to be captured by their 
affinity reagents. For example, my colleagues and I have encoun- 
tered proteins that are not detected using antigen capture meth- 
ods but are readily detected using the same capture antibodies in 
a sandwich-based assay (U.B. Nielsen, M.H. Cardone, A.J. 
Sinskey, G.M. and RK. Sorger, unpublished data). 

Unlike the antigen capture method, the sandwich approach 
does not require the proteins to be labeled. This simplifies sample 
preparation and vastly increases throughput, which is why this 
strategy is likely to dominate array-based diagnostics. In addi- 
tion, the sandwich approach has the distinct advantage that it 
addresses the largest problem confronting protein-detecting 
microarrays — ^that is, specificity. As anyone who has carried out 
an immunoblot knows, most antibodies crossreact with other 
cellular proteins. Both the direct assay approach and the antigen 
capture approach are analogous to a western blot in which each 
lane has been compressed into a single band; all the crossreacting 
proteins contribute to the total signal. By contrast, a sandwich 
assay is analogous to an immunoprecipitation reaction coupled 
with a compressed western blot. One is far more likely to observe 
a single band on a western blot when the product of an immuno- 
precipitation reaction is analyzed than when total cellular lysate 
is analyzed. The disadvantage of the sandwich approach, how- 
ever, is that two noncompeting affinity reagents are required for 
each protein. 



Complications from regulated protein-protein interactions. A 
potential caveat of protein- detecting microarrays is that regu- 
lated protein-protein interactions complicate the interpretation 
of data arising from either antigen capture or sandwich assays. 
For example, a protein may be free in solution under one set of 
conditions and form part of a large, multiprotein complex under 
another set. Although the abundance of the protein does not 
actually change in going from the first set of conditions to the 
second, an antigen capture assay would report a significant 
increase (Fig. 3a)^ Even if the individual carrying out the experi- 
ment were aware of this interaction, it is not clear how she or he 
would distinguish upregulation from complex formation. 

In a multiplexed sandwich assay, two potential problems con- 
front the investigator. First, the epitope recognized by the detec- 
tion reagent may become obscured in the complex, resulting in 
an apparent decrease in signal on complex formation. Second, 
the presence of detection antibodies directed against other mem- 
bers of the complex would result in an apparent increase in signal 
(Fig. 3b). In either case, however, knowledge of the biology would 
enable the investigator to circumvent these problems. Detection 
antibodies could be chosen that do not target protein-protein 
interlaces, and arrays could be subdivided physically to avoid the 
tietection of associated proteins. These complications highlight 
the neetl to understand the underlying function of the proteins 
under investigation. 

Functional proteomics 

Genome sequencing projects contribute daily to lists of the pro- 
tein components of cells and organisms; determining what these 
components do is the task of functional proteomics. As part of 
this endeavor, considerable effort is now being devoted to defin- 
ing protein-protein interactions on a system-wide or genome- 
wide basis. One technique that has been used extensively is the 
yeast two-hybrid system^**"^^. This in vivo assay, although easy to 
implement and of considerable utility, has several limitations. 
Proteins that function as transcriptional activators yield false 
positives when fused to DNA-binding domains, whereas false 
negatives arise when proteins are displayed inappropriately or 
fail to fold correctiy in yeast. More significantly, the investigator 
is unable to control either the post-translational modification 
state of the proteins under investigation or the environment 
imder which the interactions are being studied. 

Recently, immunoprecipitation coupled with mass spectrome- 
try has been used to identify multiprotein complexes on a large 
scale in Sacdwromyces cerevisiae^^'^'^. This technique, which can 
be adapted reatlily to high-throughput investigations, identifies 
proteins that purif)' together with an epitope-tagged bait protein. 
Although this is a very efficient way to identify protein com- 
plexes, it has ffie caveat that a single bait protein may occur in 
more than one complex in a cell. It may therefore bring down 
two or more proteins that never actually colocalize, giving the 
illusion that protein complexes are bigger and more elaborate 
than they really are. 

As a complement to these approaches, protein microarray 
technology provides a well- controlled, in vitro way to study func- 
tion on a system-wide or genome- wide basis. For this applica- 
tion, the proteins themselves, rather than affinity reagents, are 
arrayed on a solid support. To distinguish ffiese types of arrays 
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Fig. 4 Proof-of-concept experiments showing the detection of protein-protein interactions on a model protein func- 
tion microarray^'. Three different proteins were spotted in quadruplicate on each of five glass microscope slides, 
a. Slide probed with IgG labeled with BODIPY-FL (a fluorophore); blue spots indicate an interaction of IgG with immobi- 
lized protein G. b, Slide probed with Cy3-labeled kBa; green spots indicate an interaction of IkBu with immobilized 
p50. c,d. Slides probed with Cy5-labeled FKBP12 in the presence (c) or absence (d) of 100 nM rapamycin; red spots indi- 
cate the rapamycin-dependent interaction of FKBP12 with immobilized FRB. e. Slide probed with a mixture of all three 
labeled proteins and 100 nM rapamycin. 



from protein-detecting micro- 
arrays, I refer to them (using 
Kodadek's nomenclature^) as 
'protein function microarrays'. 

An advantage of studying 
proteins in an array format is 
that the investigator can con- 
trol the conditions of the 
experiment. This includes not 
only factors such as pH, tem- 
perature, ionic strength and 



under investigation. An early 
study by Ge'5 illustrates this 
principle. Ge prepared low-density arrays composed of 48 puri- 
fied human proteins spotted on nitrocellulose membranes and 
probed his arrays with several different radiolabeled molecules 
including proteins, nucleic acids and small organic com- 
pounds. Because his arrays featured both a phosphorylated and 
nonphosphorylated form of the protein PC4, he was able to 
show that a double-stranded DNA probe was bound more 
tightly by the phosphorylated form of the protein than by the 
unmodified one. 

Although low- density arrays such as these are useful, there is 
substantial benefit to be gained from using microarray technol- 
ogy. First, in principle thousands of proteins can be spotted on a 

lancously Ihc lunclion of many diffcrenl proteins wilh minimal 
sample consumption. Second, hundreds or even thousands of 
copies of an array can be fabricated in parallel, enabling the same 
proteins to be probed repeatedly with many different molecules 
under many different conditions. It is the combination of these 
two features that makes microarray technology so well suited to 
systems-oriented proteomics. 

In one of the earliest reports describing methods to fabricate 
microarrays of functionally active proteins, Arenkov et al'^ 
spotted purified enzymes onto glass-supported micromatrices 
of polyacrylamide gel pads and showed that the immobilized 
proteins retained their catalytic activity. Shortly thereafter, Stu- 
art Schreiber and I reported simplified methods to fabricate 
and process protein function microarrays using commercially 
available arrayers and scanners^^. We showed that purified pro- 
teins retain their activities when spotted onto chemically 
derivatized glass slides featuring amine-reactive, hydrophilic 
surfaces. Nanoliter volumes of purified proteins were spotted 
at high spatial densities, permitting over ten thousand spots to 
fit on a single glass microscope slide. Using surfaces displaying 
aldehyde groups or featuring a covalently bound layer of chem- 
ically activated serum albumin, we showed that protein func- 
tion microarrays can be used to detect stable interactions 
between two proteins, transient interactions between enzymes 
and their substrates, and both high and low affinity interac- 
tions between proteins and small molecules (see, for example. 
Fig. 4). A key feature of this study was that the reported meth- 
ods could be carried out with standard instrumentation, mak- 



ing protein microarray technology readily accessible for func- 
tional proteomics. 

Protein fiinction microarrays for focused proteomics. Array 

technology provides an ideal way to study focused collections 
of proteins or protein domains. In a study illustrating this con- 
cept, Zhu et alr'^ cloned, expressed and purified 119 of the 122 
predicted protein kinases in yeast with the goal of analyzing 
their substrate specificities. To reduce sample consumption, 
they fabricated what were, in essence, miniaturized microtiter 
plates: 'chips' featuring 10 X 14 wells with a pitch of 1.8 mm. 
They covalently coated 16 of these chips with 16 different pro- 
tein or peptide substrates and lell a seventeenth chip uncoated 
to study autophosphorylatioii. iliey then manually pipetted a 
different purified kinase, together with [y-^^PlATP, into each 
well. The wells were washed and exposed to a phosphorimager 
to detect putative substrates. As well as observing different 
degrees of specificity among the various kinases, Zhu et al?^ 
discovered that 27 of the 119 kinases could phosphorylate 
poly(Tyr-Glu). 

On the basis of sequence analysis, yeast do not encode any 
members of the conventional tyrosine kinase family; however, 
the exjieriments of Zhu et iil/* suggest tiiat, by analyzing a large 
set of yeast kinases in vitro, il may be possible to predict tyrosine 
kinases on the basis of their biochemical function. As this is a 
rather controversial subject, it will be interesting to see how 
many of these kinases actually phosphorylate tyrosine residues 
on their in vivo substrates. 

Protein fiinction microarrays for discovery-oriented pro- 
teomics. Protein inicroarrays can also be used lo sludy funclion 
on a genome-wide basis. Wilh Ibis as Iheir goal, Snyder and co- 
workers^' cloned 5,800 of the roughly 6,200 yeast open reading 
frames into a yeast high-copy expression vector carrying an 
amino-terminal tag of glutathione S-transferase (GST) plus 
oligohistidine. They expressed the proteins in 96-well blocks 
and purified them by affinity chromatography in microtiter 
plates. The purified material was then spotted in duplicate onto 
nickel-coated microscope slides to produce what they termed a 
yeast 'proteome chip' (Fig. 5). When they probed one of their 
chips with labeled calmodulin, they were able to identify 6 of 
the 12 known yeast calmodulin targets and an additional 33 
potential targets. 

A unique advantage of protein function microarrays is that 
they can be used to study the interaction of proteins with non- 
proteinaceous molecules, including nucleic acids, hpids and 
small organic compounds. To identify yeast proteins that bind 
closely related phospholipids, Snyder and co-workers^' probed 
six of their proteome chips with labeled hposomes, identifying 
150 putative phosphohpid-binding proteins. By comparing the 
results from each independent experiment, they were able to 
determine which interactions were specific for a given lipid and 
which interactions were not specific. Curiously, of the 48 pro- 
teins in yeast that are predicted to contain either PH, PX or FYVE 
domains (the best- characterized of the lipid-binding 
domains*'^), only 1 protein (OPYl) was identified in their 
screens. It is likely that many more yeast proteins recognize the 
phospholipids imder investigation, but were either not present in 
sufficient amounts on the arrays or not functional. Nevertheless, 
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Fig. 5 A 'proteome chip' composed of 
6,566 protein samples representing 
5,800 unique proteins, which are spot- 
ted in duplicate on a single nickel- 
coated glass microscope slide^'. The 
immobilized GST fusion proteins were 
detected using a labeled antibody 
against GST 




[iroteins. Solubility alone, 
although informative, does not 
indicate that a protein is folded 



the ability to assess specificity represents a key feature of protein 
array technology that will no doubt prove invaluable in future 
whole-proteome experiments. By the same token, protein arrays 
should prove useful for evaluating the specificity of lead com- 
pounds in drug discovery efforts, for identifying secondary tar- 
gets of bioactive small molecules, and even for evaluating 
specificity among a set of related targets in a high-throughput 
screening format. 

Future prospects 

Protein function microarrays will benefit undoubtedly from 
improved methods of fabrication, processing and analysis, but at 
present the greatest obstacle barring their widespread use is the 
production of large collections of pure, recombinant proteins. 
Such production requires, as a necessary first step, the cloning of 
cDNAs. One a[.[.r()ach has been described in which cDNA 
libraries are screened lor prolein-producing clones using liigh- 
throughput methods*^'*^. Although this is a potentially econom- 
ical way to generate large collections of proteins and has been 
used for some discovery- oriented applications*^'**, it is limited 
in several respects. The resulting libraries are not normalized, 
they fail to contain genes that are expressed at low abundance, 
and the proteins often begin and end at random positions in the 
coding sequence. With the availability of whole-genome 
sequence data, systematic efforts are now underway to prepare 
perfectiy normalized, indexed collections of full-length open 
reading frames in recombination-based cloning vectors*^'''*. 
Resources such as these, coupled with high- throughput methods 
of expressing and purifying recombinant proteins*^'''^, should 
accelerate greatiy the application of microarray technology to 
functional proteomics. 

So what can we expect in the coming years? At the very least we 
can anticipate continued innovation. Numerous alternative 
types of array have been reported already, most notably bead- 
based 'virtual' arrays*' and live, transfected cell arrays^". Each 
approach brings with it both advantages and limitations. Regard- 
less of the approach, there is a growing need in array-based pro- 
teomics for an increased emphasis on quality rather than on 
numbers. With respect to protein- detecting microarrays, the 
crossreactivity of affinity reagents needs to be assessed and 
reported. For protein function microarrays, the purity and 
integrity of the proteins need to be determined, and their con- 
is need to be normalized. Perhaps the biggest challenge 
a feasible way to assess the functional state of purified 



ately. By paying close attention to 

both systems- oriented analyses 
and discovery- oriented expedi- 
tions should lead us ultimately 
toward a more integrated under- 
standing of biological processes. 
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